Mining Open Source Software (OSS) Data Using Association Rules Network
نویسندگان
چکیده
The Open Source Software(OSS) movement has attracted considerable attention in the last few years. In this paper we report our results of mining data acquired from SourceForge.net, the largest open source software hosting website. In the process we introduce Association Rules Network(ARN), a (hyper)graphical model to represent a special class of association rules. Using ARNs we discover important relationships between the attributes of successful OSS projects. We verify and validate these relationships using Factor Analysis, a classical statistical technique related to Singular Value Decomposition(SVD).
منابع مشابه
Warehousing and Studying Open Source Versioning Metadata
In this paper, we describe the downloading and warehousing of Open Source Software (OSS) versioning metadata from SourceForge, BerliOS Developer, and GNU Savannah. This data enables and supports research in areas such as software engineering, open source phenomena, social network analysis, data mining, and project management. This newly-formed database containing Concurrent Versions System (CVS...
متن کاملAn Adaptive Filter-Framework for the Quality Improvement of Open-Source Software Analysis
Knowledge mining in Open-Source Software (OSS) brings a great benefit for software engineering (SE). The researchers discover, investigate, and even simulate the organization of development processes within open-source communities in order to understand the community-oriented organization and to transform its advantages into conventional SE projects. Despite a great number of different studies ...
متن کاملAntecedents of open source software defects: A data mining approach to model formulation, validation and testing
This paper develops tests and validates a model for the antecedents of open source software (OSS) defects, using Data and Text Mining. The public archives of OSS projects are used to access historical data on over 5,000 active and mature OSS projects. Using domain knowledge and exploratory analysis, a wide range of variables is identified from the process, product, resource, and end-user charac...
متن کاملPredicting OSS Development Success: A Data Mining Approach
Open Source Software (OSS) has reached new levels of sophistication and acceptance by users and commercial software vendors. This research creates tests and validates a model for predicting successful development of OSS projects. Widely available archival data was used for OSS projects from Sourceforge. net. The data is analyzed with multiple Data Mining techniques. Initially three competing mo...
متن کاملProject Development Analysis of the OSS Community Using ST Mining
The OSS (Open Source Software) phenomenon is a novel, widely growing approach to develop both applications and infrastructure software recently. The fast growth of the community increases the interests in OSS related research. Accurate prediction of the project success is one of the interesting studies in OSS research. We propose to use the ST (Spatial Temporal) data mining techniques to predic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003